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METHODS AND APPARATUS FOR MOLECULAR DATA STORAGE, 

RETRIEVAL AND ANALYSIS 

FIELD OF THE INVENTION 

5 The invention is in the field of molecular-scale devices for analysing 

molecules, and methods of using such devices. More particularly, it relates to 
molecular information storage and information retrieval systems. 

BACKGROUND OF THE INVENTION 

1 o It has long been known that DNA is an efficient means for storing vast 

amounts of information in a very small space. DNA as it is stored inside somatic 
p cells may be thought of as read-only memory (ROM), inasmuch as the information 

P stored in these DNA molecules is generally not •written'. Biochemical methods for 

sf "writing" (assembling oligonucleotides) and reading (sequencing) DNA have thus 

^ 15 far not been easily adapted to non-biological memory devices. 

f Alternative methods are known for using polymers for electronic molecular 

g memories (see Heller and Tu, US Patent No. 6,067,246) . For example, Hopfield et 

H a/, disclose a polymer based shift register memory which incorporates charge 

§• 20 transfer groups. Others have proposed an electronic memory device in which DNA 

fU is used with electron conducting polymers (Robinson et a/.). 

There are many applications for methods that may be used to read the 
biological information recorded by DNA. For example.methods for the detection of 

25 single nucleotide polymorphisms (SNP's) in nucleic acid sequences have many 
important applications, potentially including mutagenicity assessment of drugs and 
chemicals, measuring environmental levels of genotoxins, cancer cell detection and 
methods of screening for genetic diseases. SNPs may be detected as base-pair 
mismatches in a DNA duplex. Present methods for the detection of base-pair 

30 mismatches primarily involve biochemical sequencing techniques. An innovative 
method has been described for using a hemolysin nanopore to detect the blockade 
events caused by single DNA hairpin molecules to provide sequence information at 
a single nucleotide resolution (Vercoutere etai). Similar methods have been used 
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that employ a DNA oligonucleotide tethered within the lumen of a hemolysin 
nanopore to detect differences in ionic current flow through the pore (Howorka et 

a/.). 

Nanopores have previously been used for the characterization of polymeric 
molecules. For example, U.S. Patent Nos. 6,267,872; 6,015,714 and 5,795,782 
(incorporatednierein by reference) disclose methods and apparatus for the 
characterization of polymeric molecules such as nucleic acids using thin film 
supports containing ion-permeable nanometer-scale channels, frequently called 
nanopores. There have been numerous reports describing methods for the rapid 
characterization of polynucleotides passing through a lipid bilayer by measuring 
ionic current changes at nanopores (Kasianowicz et al.; Akeson et al.; Meller et al.; 
Howorka et aL; Vercoutere et al.; Howorka et al. Nat. Biotech.; all of which are 
incorporated herein by reference). 

A new form of conductive nucleic acid, called M-DNA, has recently been 
described in which the imino protons of each base pair may be replaced by a metal 
ion such as Zn 2+ , Ni 2+ or (^(International Patent Publication WO 99/31 115; Aich 
et al.', and Rakitin et aL; all of which are incorporated herein by reference). 

SUMMARY OF THE INVENTION 

In one aspect, the invention provides processes for recording information in 
a polymer, such as a nucleic acid polymer. The invention also provides devices for 
storing information, such as a device comprising a metal-containing nucleic acid 
duplex housed in the lumin of a channel formed in a membrane. Processes for 
using such devices may involve providing a channel separating a hybridization 
medium and a dissociation medium. The channel may be dimensioned to allow 
lineal translocation of a nucleic acid duplex or a metal-containing nucleic acid 
duplex between the hybridization medium and the dissociation medium. A first 
strand of nucleic acid may be provided with a second strand of nucleic acid, the first 
and the second nucleic acid strands comprising a plurality of nitrogen-containing 
aromatic bases covalently linked by a backbone. The nitrogen-containing aromatic 
bases of the first nucleic acid strand may be capable of being joined by hydrogen 
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bonding in the hybridization medium to the nitrogen-containing aromatic bases of 
the second nucleic acid strand, so that the nitrogen-containing aromatic bases on 
the first and the second nucleic acid strands form hydrogen-bonded base pairs in 
stacked arrangement in the nucleic acid duplex. The hydrogen-bonded base pairs 
may be capable of interchelating a divalent metal cation coordinated to a nitrogen 
atom in one of the aromatic nitrogen-containing aromatic bases to form the metal- 
containing nucleic acid duplex. 

In some embodiments, information may be recorded in the nucleic acid 
polymer by modulating the translocation of the first and second strands of nucleic 
acid through the channel between the dissociation medium and the hybridization 
medium, while modulating the electrostatic potential across the channel. In this 
way, the incorporation of the divalent metal ion in the nucleic acid duplex may be 
modulated as the duplex forms in the hybridization medium. Information may be 
read from the nucleic acid polymer by detecting the presence or absence of the 
divalent metal cation in the nucleic acid duplex. The presence or absence of the 
divalent metal cation may for example be detected by measuring the electrical 
conductance across the channel as the nucleic acid duplex is translocated through 
the channel between the hybridization medium and the dissociation medium. The 
nucleic acid duplex may for example be coupled to a magnetic bead, and the 
translocation of the nucleic acid duplex through the channel may be mediated by 
modulating a magnetic field across the channel. 

In alternative embodiments, the channel may be formed in a lipid membrane, 
for example using a pore forming protein. The hybridization medium and the 
dissociation medium may be electrically conductive aqueous solutions. The 
polymer may be a deoxyribonucleic acid, and the nitrogen-containing aromatic 
bases may be selected from the group consisting of adenine, thymine, guanine and 
cytosine. The process of any one of claims 1 through 8 wherein the divalent metal 
cation is selected from the group consisting of Zn 2+ , Co 2+ , and Ni 2+ . The divalent 
metal cations may be substituted for imine protons of the nitrogen-containing 
aromatic bases, and the nitrogen-containing aromatic bases are selected from the 
group consisting of thymine and guanosine. At least one of the aromatic nitrogen- 
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containing aromatic bases may be thymine, having an N3 nitrogen atom, and the 
divalent metal cation may be coordinated by the N3 nitrogen atom. At least one of 
the aromatic nitrogen-containing aromatic bases may be guanine, having an N1 
nitrogen atom, and the divalent metal cation may be coordinated by the N1 nitrogen 
5 atom. 



In some embodiments, the invention utilises a polymer capable of selectively 
binding a metal ion when single strands of the polymer join to form a duplex. In 
some embodiments, adjacent media- compartments may be provided separated by 
10 a non-conducting membrane, such that the polymer favours the formation of single 
strands on one side of the membrane and duplex formation is favoured on the other 
p side of the membrane. The solution that favours the formation of single stranded 

Q polymers also contains metal ions in solution capable of diffusing through the 

SI solution. The membrane also defines a nanopore sufficient to accommodate the 

p 15 length-wise passage of the polymer through the nanopore one base-pair at a time. 

In the 'write 1 mode the polymer passes from the solution favouring single strand 
* formation through the pore into the solution favouring duplex formation. As the 

® polymer passes through the pore in the write mode a metal ion can be selectively 

r* incorporated at each base-pair position by applying an appropriate potential to the 

■Q 20 pore. An alternative potential may be applied to repel the metal ion, facilitating the 
^ formation of base pairs that lack a metal ion. In the 'read 1 mode the polymer 

passes from the solution favouring duplex formation through the pore into the 
solution favouring single strand formation. The nanopore in the read mode 
provides a means for detecting the presence or absence of a metal ion in the base 
25 pairs of the duplex as the polymer passes through the pore. 



In an alternative aspect, the invention provides processes for detecting a 
base pair mismatch is a nucleic acid polymer. In such processes, a channel may be 
provided separating a first pool and a second pool of a medium. The channel may 
30 be dimensioned to allow sequential monomer-by-monomer lineal translocation of a 
polymer, such as a metal-containing nucleic acid duplex, between the first and 
second pools of the medium. Where the polymer is a metal-containing nucleic acid 
duplex, the nucleic acid duplex may comprise a first strand of nucleic acid and a 
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second strand of nucleic acid, the first and the second nucleic acid strands 
comprising a plurality of nitrogen-containing aromatic bases covalently linked by a 
backbone. At least some of the nitrogen-containing aromatic bases of the first 
nucleic acid strand may match the nitrogen-containing aromatic bases of the 
5 second nucleic acid strand, and the matching base pairs may be joined by 

hydrogen bonding to the nitrogen-containing aromatic bases of the second nucleic 
acid strand. The matching nitrogen-containing aromatic bases on the first and the 
second nucleic acid strands may form hydrogen-bonded base pairs in stacked 
arrangement along the length of the nucleic acid duplex. The nucleic acid duplex 
10 may be subjected to a basic solution in the presence of a divalent metal cation, 
under conditions effective to form a metal-containing nucleic acid duplex. The 
matching hydrogen-bonded base pairs of the metal-containing nucleic acid duplex 
jj may comprise an interchelated divalent metal cation coordinated to a nitrogen atom 

SJ in one of the aromatic nitrogen-containing aromatic bases, and wherein a 

III 

p 15 mismatched base pair does not interchelate a divalent metal cation. The metal- 
^ containing nucleic acid duplex may be translocated through the channel from the 

s first pool to the second pool, and the presence or absence of divalent metal cations 

in the base pairs of the nucleic acid duplex may be detected by measuring the 
electrical conductance across the channel as the metal-containing nucleic acid 
q 20 duplex is translocated through the channel between the first and second pools. 

m 



BRIEF DESCRIPTION OF THE DRAWINGS 

25 Figure 1 is a schematic diagram showing the 'write' mode in which 

information is recorded in a nucleic acid molecule in one embodiment of the 
invention. 

Figure 2 is a schematic diagram showing the 'read' mode in which 
30 information is retrieved from a nucleic acid molecule in one embodiment of the 
invention. 
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Figure 3 is a schematic diagram showing an implementation of the 
processes of the invention for base pair mismatch detection. 

Figure 4 is a pictorial representation of a putative modeled structure of M- 

5 DNA. 

Figure 5 is a pictorial depiction of a putative base pair scheme for M-DNA 
shown in Figure 4, according to an alternate embodiment of the invention. 

1 0 Figure 6 is a pictorial depiction of a putative base pairing scheme for M-DNA 

shown in Figure 4, according to an alternate embodiment of the invention. 

m 

Jj DETAILED DESCRIPTION OF THE INVENTION 

Nl As shown in Figure 1 , in one aspect the invention provides systems for 

hi 

p 15 recording information in a polymer. The schematic diagram of Figure 1 shows the 
ji 'write' mode of one embodiment, illustrating a nucleic acid strands being 

pulled (moving to the left in the Figure) into a duplex favoring hybridization 



solution. As illustrated, the translocation of the nucleic acid is mediated by a 



H magnetic field acting on a magnetic bead attached to the nucleic acid. During 

0 20 sequential monomer-by-monomer lineal translocation of the nucleic acid polymer, 



the potential across the membrane may be modulated to control whether a 
metal ion is inserted into the helix. If a metal ion is permitted to enter the 
duplex, a metal-containing base pair is formed. If metal ions are excluded from 
the duplex, a non-metal-containing base pair is formed. Information is thereby 
25 stored in the duplex in the form of the presence or absence of metal ions in the 
duplex. 

As shown in Figure 2, in one aspect of the invention, information may be 
retrieved or read from polymers in a process that is in some respects a 
30 reversal of the foregoing process of writing information to the polymer. Figure 
1 is a schematic diagram showing the 'read 1 mode of one embodiment of the 
invention, in which the polymer, such as DNA, is pushed (translocated to the right 
in the Figure) into a single strand favoring 'dissociation' medium, such as an 
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aqueous solution. As illustrated, the translocation of the polymer Is mediated by a 
magnetic field acting on a magnetic bead attached to the polymer. As each base 
pair passes through the pore, a detector senses whether a metal ion is present or 
absent in the base pair. For example, U.S. Patents 5,795,782 and 6,015,714 
describe a variety of methods for characterization of polymers which are illustrative 
of alternative detection methods that may be used in the present invention. 

An alternative aspect of the invention is illustrated in Figure 3, which is a 
schematic diagram showing a system for detecting base pair mismatches, such as 
are present in single nucleotide polymorphisms (SNP). In such embodiments, a 
DNA duplex, such as a duplex formed by hybridization of a probe and a target 
sequence, is passed through a pore or channel to detect the presence or 
absence of metal ions along the duplex. In such embodiments, conditions are 
provided so that as the duplex forms metal ions are incorporated at each 
matching base pair, and are not incorporated where a base-pair mismatch 
occurs. The detection of base pair mismatches is functionally similar to the 
'read 1 mode shown in Figure 2, with the presence or absence of a metal ion 
being detected as the duplex passes through the channel. 

In some embodiments, the invention utilises nucleic acids that are related in 
structure to conductive metal-containing oligonucleotides as disclosed in 
International Patent Publication WO 99/31115, such as a metal-containing DNA 
duplex ("M-DNA") shown at 30 in Figure 4. As illustrated, M-DNA 30 comprises a 
first strand of nucleic acid 32 and a second strand of nucleic acid 34. The first 32 
and the second 34 nucleic acid strands include a plurality of nitrogen-containing 
aromatic bases 35 and 36, respectively, covalently linked by a backbone 38. The 
nitrogen-containing aromatic bases 35 of the first nucleic acid strand 32 are joined 
by hydrogen bonding to the nitrogen-containing aromatic bases 36 of the second 
nucleic acid strand 34. The nitrogen-containing aromatic bases 35 and 36 on the 
first 32 and the second 34 nucleic acid strands, respectively, form hydrogen 
bonded base pairs 40 in stacked arrangement along the length of the metal- 
containing oligonucleotide duplex 30. The hydrogen-bonded base pairs 40 include 
an interchelated divalent metal cation 42 coordinated to a nitrogen atom in one of 



the nitrogen-containing aromatic bases 35 or 36. In the illustrated embodiment the 
first and second nucleic acid strands 32 and 34 respectively are deoxyribonucleic 
acid and the nitrogen-containing aromatic bases 35 and 36 are selected from the 
group consisting of adenine, thymine, guanine and cytosine. 

Alternatively, other backbone structures 38 may be effective to appropriately 
align the aromatic nitrogen-containing bases 35, 36 in a stacked arrangement 
capable of chelating metal ions 42 and conducting electrons. For example, 
phosphoramide, phosphorothioate, phosphorod'rthioate, O-methylphosphoroamidite 
or peptide nucleic acid linkages may be effective to form such a backbone. 
Similarly, other components of the backbone 38 may vary in accordance with the 
invention, encompassing the deoxyribose moieties, ribose moieties, or 
combinations thereof. 

The nitrogen-containing aromatic bases 35 and 36 may be those that occur 
in native DNA and RNA, namely adenine, thymine, cytosine, guanine or uracil, or 
variants thereof such as 5-fluorouricil or 5-bromouracil. Alternative aromatic 
compounds may be utilized, such as aromatic compounds capable of interchelating 
a divalent metal ion coordinated to an atom in the aromatic compound, and capable 
of stacking, to produce a metal-containing oligonucleotide duplex. Alternative 
aromatic compounds may for example include: 4-acetyicytidine; 5- 
(carboxyhydroxymethyl) uridine; 2 -O-methylcytidine; 5-carboxymethylaminomethyl- 
2-thiouridine; 5-carboxymethylaminomethyluridine; dihydrouridine; 2 J -0- 
methylpseudouridine; beta, D-galactosylqueuosine; 2-O-methylguanosine; inosine; 
N6-isopentenyladenosine; 1-methyladenosine; 1-methylpseudouridine; 1- 
methylguanosine; 1-methylinosine; 2,2-dimethyIguanosine; 2-methyIadenosine; 2- 
methylguanosine; 3-methylcytidine; 5-methylcytidine; N6-methyladenosine; 7- 
methylguanosine; 5-methylaminomethyluridine; 5-methoxyaminomethyl-2- 
thiouridine; beta, D-mannosylqueuosine; 5-methoxycarbonylmethyl-2-thiouridine; 5- 
methoxycarbonylmethyluridine; 5-methoxyuridine; 2-methyithio-N6- 
isopentenyladenosine; N-((9-beta-D-ribofuranosyl-2-methylthiopurine-6- 
yl)carbamoyl)threonine; N-((9-beta-D-ribofuranosylpurine-6-yl) N- 
methycarbamoy1)0ireonine; uridine-5-oxyacetic acid-methylester; uridine-5- 
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oxyacetic acid; pseudouridine; queuosine; 2-thiocytidine; 5-methyl-2-thiouridine; 2- 
thiouridine; 4-thiouridine; 5-methyIuridine; N^(9-beta-D-ribofuranosylpurine-6-yl) - 
carbamoyl) threonine; 2-0-methyJ-5-methyluridine; and 2*-0-methyluridine; 3-(3- 
amino-3-carboxy-propyl) uridine; hypoxanthine, 6-methyladenine, 5-me pyrimidines, 
particularly 5-methylcytosine (also referred to as S-methyl-Zdeoxycytosine and 
often referred to in the art as 5-me-C), 5-hydroxymethylcytosine (HMC), glycosyl 
HMC and gentobiosyl HMC, as well as synthetic nucleobases, e.g., 2- 
aminoadenine, 2-thiouracil, 2-thiothymine, 5-bromouracil, 5-hydroxymethyluraciI, 8- 
azaguanine, 7-deazaguanine, N 6 (6-aminohexyl)adenine and 2,6-diaminopurine. 

Oligonucleotides of the invention may include those containing modified 
backbones, for example, phosphorothioates, phosphotriesters, methyl 
phosphonates, short chain alkyl or cycloalkyl intersugar linkages or short chain 
heteroatomic or heterocyclic intersugar linkages. In some embodiments, the 
phosphodiester backbone of the oligonucleotide may be replaced with a polyamlde 
backbone, the nucleobases being bound directly or indirectly to the aza nitrogen 
atoms of the polyamide backbone (Nielsen et ah, Science, 1991, 254:1497), 
Oligonucleotides may also contain one or more substituted sugar moieties, such as 
moieties at the 2' position: OH, SH, SCH 3 , F, OCN, OCH 3 OCH 3l OCH 3 0(CH 2 ) n , 
CH 3 , 0(CH 2 ) n , NH 2 or 0(CH2>n, CH 3 where n may for example be from 1 to about 
10; Ci to C10 lower alkyl, alkoxyaikoxy, substituted lower alkyl, alkaryl or aralkyl; CI; 
Br, CN; CF 3 ; OCF 3 ; O-, S-, or N-alkyl; O-, S~, or N-alkenyl; SOCH 3 ; S0 2 CH 3 ; 
ON0 2 ; N0 2 ; N 3 ; NH 2 ; heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; 
polyalkylamino; substituted silyl; an RNA cleaving group; a reporter group; an 
intercalator; and other substituents having similar properties. Similar modifications 
may also be made at other positions on the oligonucleotide, particularly the 3 1 
position of the sugar on the 3' terminal nucleotide and the 5' position of 5' terminal 
nucleotide. Oligonucleotides may also have sugar mimetics such as cyclobutyls in 
place of the pentofuranosyl group. Oligonucleotides may also include, additionally 
or alternatively, nucleobase (often referred to in the art simply as "base") 
modifications or substitutions. 



Referring to Figures 5 and 6, a pictorial depiction of base-pairing schemes 
for some embodiments of M-DNA is shown generally at 52 and 54. In some 
embodiments, as for example illustrated in Figure 4, the estimated spacing 
between the divalent metal ions 42 may be about 3, 4 or 5 A (Angstroms). A first 
embodiment shows a thymine-adenine base pair 52 in Figure 5 and a second 
embodiment shows a cytosine-guanine base pair 54 in Figure 6. In both of these 
embodiments 52 and 54, the divalent metal cation is zinc 42. Alternatively, the 
divalent metal cation 42 may be selected from the group consisting of zinc, cobalt 
or nickel. Alternative divalent metal ions may be utilized depending upon the ability 
of the ions to participate with the other substituents of the molecules of the 
invention in the formation of a metal-containing oligonucleotide duplex. 

Referring to Figure 5, in the thymine-adenine base pair 52, one aromatic 
nitrogen-containing aromatic base is thymine 55 which possesses an N3 nitrogen 
atom 60. The divalent metal cation zinc 42 is coordinated by the N3 nitrogen atom 
60 of thymine 55, where the divalent metal cation zinc is substituted for an imine 
proton of the nitrogen-containing aromatic base. In an alternate embodiment 
shown in Figure 6, in the cytosine-guanine base pair 54, the guanine 58 nitrogen- 
containing aromatic base has an N1 nitrogen atom 62 and the divalent metal cation 
zinc 42 is coordinated by the N1 nitrogen atom. Alternatively, the divalent metal 
ration 42 may be complexed between aromatic moieties in alternative 
conformations. In some embodiments, as illustrated, the imino protons of each 
base pair may be replaced by a metal ion. 

In some embodiments, M-DNA 30 may be formed from B-DNA by the 
addition of metal ions, such as 0.1 mM Zn 2+ or mM NiCI 2 at an approximate pH, 
such as a pH of 9.0. There may be a concomitant release of protons, so that a 
base such as KOH may be added to maintain the pH at a desired level, such as at 
8. The conditions necessary to form M-DNA 30 will vary depending on the metal 
ion 42 or ions used and the nature of the nucleic acid 32 and 34. Routine assays 
may be carried out to determine appropriate conditions for metal-containing duplex 
formation, for example by varying parameters such as pH, nucleic acid 
concentration, metal ion concentration, and the ratio of the metal ion concentration 
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to the nucleic acid concentration. In some embodiments, a pH equal to or greater 
than 7, 7.5, 8, 8.5 or 9 may be required, and a suitable nucleic acid to metal ion 
ratio may be from about 1:1.5 to about 1:2.0. 



5 In alternative embodiments, metal cations for incorporation into a metal- 

containing duplex of the invention may be selected from the group consisting of the 
cations of Li, Be, Na, Mg, Al, K, Ca, Sc, Ti, V, Cr, Mn, Fe, Co, Ni, Cu, Zn, Ga, Ge, As, 
Rb, Sr, Y, Zr, Nb, Mo, Tc, Ru, Rh, Pd, Ag, Cd, In, Sn, Sb, Cs, Ba, La, Ce, Pr, Nd, Pm, 
Sm, Eu, Gd, Tb, Dy, Ho, Er, Tm, Yb, Lu, Hf, Ta, W, Re, Os, Ir, Pt, Au, Hg, TI, Pb, Bi, 
10 Po, Fr, Ra, Ac, Th, Pa, U, Np and Pu. For example, in some embodiments varying 
amounts of metal cations may be incorporated into a duplex, such as Zn 2+ , Ni 2 *, Co 2+ , 
m Cd 2 *, Hg 2 *, Pt 2 * and Ag 1 *, where metal ions such as Cd 2 *, Hg 2 *, Pt 2 * and Ag 1 * may 

Q 

. constitute only a portion of the metal ions in the duplex, in effect 'doping' the duplex. 

M The formation of a metal-containing duplex using alternative cations under alternative 

01 15 conditions may be monitored, for example, using an ethidium bromide fluorescence 
JS assay as described in International Patent Publication WO 99/31 1 15. In some 

embodiments, conductive M-DNA may be coupled to electron donors or electron 
dp acceptors, which may for example facilitate detection of the formation of an M-DNA 

^ duplex. The electron donor may be a molecule capable of donating an electron to 

O 20 duplex, and the electron acceptor may be a molecule capable of accepting an 

electron from the duplex. 

A variety of methods have been described for the formation of channels or 
nanopores with characteristics appropriate for use in alternative embodiments of 

25 the present invention. In some embodiments, a biological pore molecule may be 
used to form a channel through which to record the process of metal-containing 
polymer translocation. Channels may for example be isolated on a membrane 
patch or inserted into a synthetic lipid bilayer. For example, US patents 5,795,782 
and 6,015,714, disclose a use of the maltoporin (LamB) pore wherein DNA is 

30 passed through the maltoporin pore, or over its opening, and resulting conductance 
changes can be measured. Alternative embodiments of the invention may utilize 
channel proteins which have pore sizes of between 3 andlOnm. For example, US 
patents 5,795,782 and 6,015,714 describe processes utilzing pores through which 
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a polymer can be drawn having dimensions of approximately. 0.5-2.0 nm for single 
stranded DNA; 1.0-3.0 nm for double stranded DNA and 1.0-4.0 nm for 
polypeptides. US patents 5,795,782 and 6,015,714 also describe examples of 
bacterial pore-forming proteins which may be used in alternative embodiments of 
5 the invention, such asGramicidin (e.g., Gramicidin A from Bacillus brevis; available 
from Fluka, Ronkonkoma, N.Y.); LamB (maltoporin), OmpF, OmpC, or PhoE from 
Escherichia coli, Shigella, and other Enterobacteriaceae, alpha-hemolysin (from S. 
aureus), Tsx, the F-pilus, lambda exonuclease, and mitochondrial porin (VDAC). 

10 It may be preferable in some embodiments to use biological pores that do 

not inactivate quickly, for example having inactivation times of more than about 500 
g\ msec. Inactivation times for pores may be natural characteristics of a selected pore, 

Q. or may for example be adapted by modification of the pore to alter domains 

N responsible for inactivation. Methods to alter inactivation characteristics of voltage 

fed 

g3 15 gated channels are known in the art (see e.g., Patton, et aL, Proc. Natl. Acad. Sci. 
Jl USA, 89: 10905-09 (1992); West, et al., Proc. Natl. Acad. Sci. USA, 89: 10910-14 

« (1992); Auld, et al., Proc. Natl. Acad. Sci. USA, 87: 323-27 (1990); Lopez, et al., 

Jj Neuron, 7: 327-36 (1991); Hoshi, et al., Neuron, 7: 547-56 (1991); Hoshi, et al., 

H Science, 250: 533-38 (1990), all hereby incorporated by reference). Alternative 

Q 20 embodiments may also use pores modified to have polymerase attachment sites 
^ (such as pores produced as recombinant fusion proteins). US patents 5,795,782 

and 6,01 5,714 describe, for example, methods in which a DNA polymerase 
molecule is fused to a pore molecule allowing the polymerase to move DNA over 
the pore's opening while recording the conductance across the pore. 

25 

To determine current modulation attributable to individual base pairs, one 
may use channels containing a limiting aperture that is much shorter than the full 
length of the overall channel (Weiss et al.). For example, one can modify alpha- 
hemolysin by standard molecular biological techniques such that portions of the 
30 pore leading to and away from the constriction are widened. Similarly, alternative 
biological pores could be modified using standard molecular biological techniques 
to modify the pore or channel to suit a specific purpose (Tapper and George, J Biol 
Chem 2001 Oct 12;276(41):38249-54; Gu et al. Science 2001 291(5504):636-40; 
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Chapman et al. J Physiol 530(1):21-33; Braha et al. Nat Biotechnol. 2000 
18(9): 1005-7). 



In alternative embodiments, non-biological channels or pores may also be 
5 provided. For example, a pore may be made in thin layers of conducting polymers 
using x-ray lithography or UV femtosecond lasers. Alternatively, a pore could be 
sculpted using Ion-beams (as described by Li et al. (Nature 2001 412(6843):166- 
169). U et al. describe a method of low energy ion-beam sculpting to produce 
nanopores in thin insulating solid state membranes (Si3N4 membrane). 
10 Additionally, mesomorphous materials such as clays, zeolites and carbon 

nanotubes may be used to create a pore or channel. In some embodiments, it may 
fji be possible to prepare and manipulate nanotubes as channels with a variety of 

Q diameters in the range of 3 - 10 nm (30 - 100 angstroms). US Patents 5,795,782 

S and 6,015,714 describe how appropriately sized physical or chemical pores or 

gj 15 channels could be induced in a water-impermeable barrier (solid or membranous) 
Jj up to a diameter of about 9 nm. Alternative methods and materials known in the art 

* for channel forming may be used, including track etching and the use of porous 

membrane templates(e.g., methods utilizing a scanning-tunneling microscope or 
J* atomic force microscope). US patents 5,795,782 and 6,015,714 also describe how 

Q 20 chemical channels or pores can be formed in a lipid bilayer using chemicals (or 

peptides) such as Nystatin, as is well known in the art of whole-cell patch clamping 
("perforated patch" technique); and peptide channels such as Alamethicin. 



In some embodiments, the methods of the invention involve measurements 
25 of ionic current modulation as the monomers (such as nucleotides) of a linear 
polymer (such as a nucleic acid molecule) pass through or across a channel or 
nanopore in a membrane. During polymer passage through or across the channel, 
ionic currents are modulated in a manner that reflects the properties of the polymer 
and the monomers. Accordingly, in some embodiments of the invention, to 
30 establish an ionic current, or to modulate the incorporation of metal ions in a 

polymer, a voltage gradient may be established across a membrane containing a 
channel through which a polymer is to be translocated. For example, US patents 
5,795,782 and 6,015,714 illustrate conductance monitoring methods that may be 
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adapted for use in various aspects of the present invention. For example, current 
fluctuation events taking place in the range of a few microseconds may be detected 
and recorded (HamilletaL, 1981, Pfluegers Arch. Eur. J. Physiol., 391: 85-100). In 
purified planar lipid bilayer systems (such as disclosed in US patents 5,795,782 and 
5 6,015,714; Wonderlin et al., supra), conductance measurements may be facilitated 
by using bHayers formed over very small diameter apertures (10-50 microns). Some 
embodiments of such techniques may have the advantage of allowing access to 
both sides of the bilayer, and may facilitate use of a larger bilayer target for 
reconstitution with the pore protein. Similarly, US Patent 6,267,872 describes 
10 methods and apparatus for producing and using single nanopores to measure ionic 
current flow through the nanopore. 

m 

^ Methods are available for determining the characteristics and conductance 

N properties of a wide variety of pore molecules (channels), which may be used in 

fp 15 various aspects of the invention(US patents 5,795,782 and 6,015,714; Sigworth et 
Jj al., supra; Heinemann et al., 1988, Biophys. J. ( 54: 757-64; Wonderlin et al., 1990, 

* Biophys. J., 58: 289-97). For example, in the pipette bilayer technique, an artificial 

Q 

$ bilayer containing at least one pore protein is attached to the tip of a patch-clamp 

*jj pipette by applying the pipette to a preformed bilayer reconstituted with the purified 

© 20 pore protein in advance. The very narrow aperture diameter of the patch pipette tip 

tu 

(2 microns) facilitates a reduction in background noise in this technique , and in 
some applications the limit for detectable current interruptions may be about 10 
microseconds (Sigworth et al., supra; Heinemann et al., 1990, Biophys. J., 57: 499- 
514). In some applications, purified channel proteins may be inserted in a known 

25 orientation into preformed lipid bilayers by standard vesicle fusion techniques 

(Schindler, 1980, FEBS Letters, 122: 77-79), or other means, and high resolution 
recordings may then be made to determine the characteristics of the channel. In 
some embodiments, the membrane surface may be oriented away from the pipette, 
to that it is accessible while recording. This may for example facilitate applications 

30 wherein the pore is introduced into the solution within the patch pipette rather than 
into the bath solution. 
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A polymer may be advanced through a nanopore channel by attaching a 
magnetic bead to the polymer (see Anazawa et al.; US Patent No. 6,136,543; Fry et 
al. Biotechniques (1992) 13(1):124-31) and by using a magnetic field to push or pull 
the bead and polymer. For example, to insert a nucleic acid molecule into a pore in 
5 some embodiments of the invention, a magnetic bead may be attached to one end 
of a duplex and biotin attached to the other end. The duplex may then be inserted 
into the pore so that a steptavidin-bead complex attaches to the biotinylated end of 
the duplex to retain the duplex in the pore. Alternatively, A polymer may be 
advanced through a nanopore or channel using electrophoretic current, for example 
10 as described in Kasianowicz et al. Proc. Natl. Acad. Sci. USA 93:13770-73; Akeson 
et al. Biophysical Journal 77:3227-33; Meller et al. Proc. Natl. Acad. Sci. USA 
pi 97(3): 1079-84; Howorka et al. Proc. Natl. Acad. Sci. USA 98(23):1 2996-1 3001. In 

^ some embodiments, a chemical potential difference may be established across an 

JJ interface or bilayer to force polymers through a channel without supplying an 

0 15 external potential difference across the membrane. In such embodiments, the 
^5 membrane potential may be varied ionically to produce more or less of a differential 

^ or "push" (see for example US patents 5,795,782 and 6,01 5,714). 

^ In some embodiments, the passage of polynucleotides through a channel 

Q 20 may be slowed to facilitate sensing individual nucleotide pairs (as described in US 
patents 5,795,782 and 6,015,714). in various embodiments, approaches to 
accomplish this may for example include: (a) increasing the viscosity of the 
medium, (b) establishing the lower limit of applied potential that will move 
polynucleotides into the channel (c) use of high processivity polymerase in the trans 
25 compartment to "pull" DNA through the pore in place of voltage gradients (d) 

adjusting the rate at which the magnetic bead coupled to the polymer is pulled or 
pushed. In some embodiments, enzymes may be used to translocate the polymer 
through the pore or channel. 

30 Although various embodiments of the invention are disclosed herein, many 

adaptations and modifications may be made within the scope of the invention in 
accordance with the common general knowledge of those skilled in this art. Such 
modifications include the substitution of known equivalents for any aspect of the 
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invention in order to achieve the same result in substantially the same way. For 
example, In alternative embodiments, lipid composition of the bilayer may include a 
wide variety of non-polar (and polar) components which are compatible with pore or 
channel protein incorporation. The configuration of recording apparatus (e.g., 
5 bilayer across aperture, micropipette patches, intra-vesicular recording) may be 
selected for such alternative embodiments such that signal detection is in an 
appropriate range. 

Numeric ranges recited herein are inclusive of the numbers defining the 
1 0 range. In the specification, the word "comprising" is used as an open-ended term, 
substantially equivalent to the phrase "including, but not limited to", and the word 
"comprises" has a corresponding meaning. Citation of references herein shall not 
ft be construed as an admission that such references are prior art to the present 

Nj invention. All publications, including but not limited to patents and patent 

| 15 applications, cited in this specification are incorporated herein by reference as if 
g each individual publication were specifically and individually indicated to be 

i incorporated by reference herein and as though fully set forth herein. The invention 

4> includes all embodiments and variations substantially as hereinbefore described 

Jjj and with reference to the examples and drawings. 

0 20 
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What is claimed is: 



1 . A process of recording information in a nucleic acid polymer, comprising: 

a) modulating the translocation of first. and second nucleic acid strands 
5 through a channel between a dissociation medium and a hybridization 

medium, while modulating an electrostatic potential across the channel to 
modulate the incorporation of a metal ion in a nucleic acid duplex as the 
duplex forms in the hybridization medium; 

b) wherein the channel separating the hybridization medium and the 
1 0 dissociation medium is dimensioned to allow lineal translocation of the 

nucleic acid duplex or a metal-containing nucleic acid duplex between the 
fi hybridization medium and the dissociation medium; 

y c) and wherein the first and the second nucleic acid strands comprise a 

SJ plurality of nitrogen-containing aromatic bases covalently linked by a 

Cfr 1 5 backbone, the nitrogen-containing aromatic bases of the first nucleic acid 

J strand being capable of being joined by hydrogen bonding in the 

k hybridization medium to the nitrogen-containing aromatic bases of the 

$ second nucleic acid strand so that the nitrogen-containing aromatic bases on 

J* the first and the second nucleic acid strands form hydrogen-bonded base 

g 20 pairs in stacked arrangement in the nucleic acid duplex, the hydrogen- 

bonded base pairs being capable of interchelating the metal cation 
coordinated to a nitrogen atom in one of the aromatic nitrogen-containing 
aromatic bases to form the metal-containing nucleic acid duplex. 

25 2. The process of claim 1 , further comprising the step of reading information 
from the nucleic acid polymer by detecting the presence or absence of the 
metal cation in the nucleic acid duplex. 

3. The process of claim 2, wherein the presence or absence of the metal cation 
30 is detected by measuring the electrical conductance across the channel as 

the nucleic acid duplex is translocated through the channel between the 
hybridization medium and the dissociation medium. 
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The process of any one of claims 1 through 3 wherein the nucleic acid 
duplex is coupled to a magnetic bead, and the translocation of the nucleic 
acid duplex through the channel is mediated by modulating the magnetic 
field across the channel. 

The process of any one of claims 1 through 4 wherein the channel is formed 
in a lipid membrane. 

The process of any one of claims 1 through 5 wherein the channel is a pore 
forming protein. 

The process of any one of claims 1 through 6 wherein the hybridization 
medium and the dissociation medium are electrically conductive aqueous 
solutions. 

The process of any one of claims 1 through 7 wherein the first and the 
second nucleic acid strands are deoxyribonucleic acids and the nitrogen- 
containing aromatic bases are selected from the group consisting of 
adenine, thymine, guanine and cytosine. 

The process of any one of claims 1 through 8 wherein the metal cation is 
selected from the group consisting of Zn 2+ , Co 2+ , and Ni 2+ . 

The process of any one of claims 1 through 9 wherein the metal cations are 
substituted for imine protons of the nitrogen-containing aromatic bases, and 
the nitrogen-containing aromatic bases are selected from the group 
consisting of thymine and guanosine. 

The process of any one of claims 1 through 10 wherein at least one of the 
aromatic nitrogen-containing aromatic bases Is thymine, having an N3 
nitrogen atom, and the metal cation is coordinated by the N3 nitrogen atom. 
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The process of any one of claims 1 through 1 1 wherein at least one of the 
aromatic nitrogen-containing aromatic bases is guanine, having an N1 
nitrogen atom, and the metal cation is coordinated by the N1 nitrogen atom. 

A process for detecting a base pair mismatch in a nucleic acid polymer, 
comprising: 

a) detecting the presence or absence of divalent metal cations in base 
pairs of a nucleic acid duplex by measuring electrical conductance 
across a channel as a metal-containing nucleic acid duplex is 
translocated through the channel; 

b) wherein the channel separates a first pool and a second pool of a 
medium, the channel being dimensioned to allow sequential 
monomer-by-monomer lineal translocation of the metal-containing 
nucleic acid duplex between the first and second pools of the 
medium; 

c) . wherein the nucleic acid duplex comprises a first strand of nucleic 

acid and a second strand of nucleic acid, the first and the second 
nucleic acid strands comprising a plurality of nitrogen-containing 
aromatic bases covalently linked by a backbone, at least some of the 
nitrogen-containing aromatic bases of the first nucleic acid strand 
matching the nitrogen-containing aromatic bases of the second 
nucleic acid strand and the matching base pairs being joined by 
hydrogen bonding to the nitrogen-containing aromatic bases of the 
second nucleic acid strand, the matching nitrogen-containing 
aromatic bases on the first and the second nucleic acid strands 
forming hydrogen-bonded base pairs in stacked arrangement along 
the length of the nucleic acid duplex; 

d) wherein the matching hydrogen-bonded base pairs of the metal- 
containing nucleic acid duplex comprise an interchelated divalent 
metal cation coordinated to a nitrogen atom in one of the aromatic 
nitrogen-containing aromatic bases, and wherein a mismatched base 
pair does not interchelate a divalent metal cation. 
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A device for storing information comprising a metal-containing nucleic 
acid duplex housed in the lumin of a channel formed in a membrane. 
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ABSTRACT 

In various aspects, the invention provides molecular systems for storing and 
retrieving information. In some embodiments, polymers capable of selectively 
binding metal ions, such as nucleic acids, are used to record information in the 
5 form of a particular molecular conformation. Electrochemical assay methods 
employing nanopores may be used to read and write information using such 
polymeric media. In some embodiments, information may be recorded in a nucleic 
acid polymer by magnetically modulating the translocation of the nucleic acid 
through a channel in a medium, while simultaneously modulating the electrostatic 
10 potential across the channel. In this way, the incorporation of a divalent metal ion in 
the nucleic acid duplex may be modulated to store information. In an alternative 
aspect, the invention provides analogous processes for detecting a base pair 
mismatch in a nucleic acid polymer. 
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Figure 1 
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Figure 3 
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Multiplexing: 

Length 

20 

20 

20 

21 
21 
etc. 



Position 
10 or 11 
9 or 12 
Bor 13 

11 

lOor 12 



Mutation 
cystic fibrosis 
Tay Sach's 
Muscular dystrophy 

Hemophilia A 
Hemophilia B 



Since the orientation of any duplex is unknown then e.g. position 10 and 11 
for a 20-mer cannot be distinguished. 
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